A Reasonably Language Independent, Heuristic Algorithm for the Marking of Names in Running Texts
نویسنده
چکیده
0. Introduction T h e id en tifica tio n o f n am es an d abbrev ia tions in a ra w te x t is a n im p o rtan t sub ac tiv ity in the "to k en iza tio n " p ro cess , i.e . th e id en tifica tion o f th e b as ic u n its o f th e tex t: parag raphs, sen ten ces a n d w ords. T o k en iza tio n is in its tu rn an im p o rtan t su b ac tiv ity in th e "norm alization" o f tex ts , th e to ta lity o f o p era tio n s and p repara tions a te x t has to u n d erg o b efo re it is su itab le fo r be in g ad d ed to a te x t co rpus.
منابع مشابه
Genetic algorithm for Echo cancelling
In this paper, echo cancellation is done using genetic algorithm (GA). The genetic algorithm is implemented by two kinds of crossovers; heuristic and microbial. A new procedure is proposed to estimate the coefficients of adaptive filters used in echo cancellation with combination of the GA with Least-Mean-Square (LMS) method. The results are compared for various values of LMS step size and diff...
متن کاملAn Efficient Genetic Algorithm for Task Scheduling on Heterogeneous Computing Systems Based on TRIZ
An efficient assignment and scheduling of tasks is one of the key elements in effective utilization of heterogeneous multiprocessor systems. The task scheduling problem has been proven to be NP-hard is the reason why we used meta-heuristic methods for finding a suboptimal schedule. In this paper we proposed a new approach using TRIZ (specially 40 inventive principles). The basic idea of thi...
متن کاملAn Efficient Genetic Algorithm for Task Scheduling on Heterogeneous Computing Systems Based on TRIZ
An efficient assignment and scheduling of tasks is one of the key elements in effective utilization of heterogeneous multiprocessor systems. The task scheduling problem has been proven to be NP-hard is the reason why we used meta-heuristic methods for finding a suboptimal schedule. In this paper we proposed a new approach using TRIZ (specially 40 inventive principles). The basic idea of thi...
متن کاملA Literary Anthroponomastics of Three Selected African Novels: A Cross Cultural Perspective
Names as markers of identity are a source of a wide variety of information. This paper explores the names of characters to show the sociocultural factors which influence the choice of names and the effects that the names of these characters have on the roles they play. Using a variety of personal names from Ayi Kwei Armah’s Fragments, Buchi Emecheta’s The Joys of Motherhood, a...
متن کاملDynamics of a Running Below-Knee Prosthesis Compared to Those of a Normal Subject
The normal human running has been simulated by two-dimensional biped model with 7 segments. Series of normal running experiments were performed and data of ground reaction forces measured by force plate was analyzed and was fitted to some Fourier series. The model is capable to simulate running for different ages and weights at different running speeds. A proportional derivative control algorit...
متن کامل